AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering
arxiv.org·1h
Making a Language
thunderseethe.dev·8h
Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum
arxiv.org·1h
Microspeak: On fire, putting out fires
devblogs.microsoft.com·1d
Introducing multimodal retrieval for Amazon Bedrock Knowledge Bases
aws.amazon.com·1d
Mimestream 1.9.7
tidbits.com·2d
Loading...Loading more...